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non-homoIogousrecombinatro.,andaUo,vingover.expressionofthe^^^^ 

gene in the cell, . . 



■ Ths cell conaiiing aSyScfor is icr^ined far fetpi4i„a of ii Jtoe ' • 
_ ^=«U"«^-«P^sfaea=g=^oanbeeul,„,edmvf^»>«top,od„ce 

de.u*,e amounts ef4=e=.=p,„du« of 4= .ndog«.o^g.„e,io„«p.c.si.n 
has been a=a™.ed „r increased. Tie gene product can *«, U isolated and 

purified. N 

Alternatively, the ceU can be allowed to express the desired gene product 

in vivo. 



The vector comtruct can consist essentially of the transcriptional 
regulatory sequence and the ^lice donor sequence. 

AnyofthevectorconstmctsusedinthemeiJaodsdescribedhereincanalso 

mcludeasecretionsignal sequence, ^secretion signal sequence is ^ged in 
the construct so that it wiU be operably linked to lie activated endogenous 

protein. Thereby, secretion of tbe protein -of interest occurs in the cell and 

punficationofthatproteinisfacilitated.Accoringly.n.ethodscanincludeJstep 
m which the protein expression product is secreted from the cell. 

The invention also encompasses cells made by any of the above methods 
The mvention encompasses cells containing the vector constructs, ceUs in which 

fhe.vectorconsiructshaveintegrated, and cells which arc over-expressingdesired 
gene products fiom an endogenous gene, over-expression being driven by the 

introduced transcriptional regulatory sequence. 

The cells can be. isolated and cloned. • ' 
The methods can be carried out in any cell of eukaxyotic origin, such as 
i^gal, plant or animal. Pxefexred embodiments include vertebrates and 
particularly mammals, and more particularly, humans. 

. A single cell made by the methods described above can over-expiess a 

smglegeneormore than one gene. Morelhan one gene inacell canbe activated 
by the integration of a single tj^e of construct into multiple location, in the 
genome. Similarly, more than one gene ia a cell can be activated by the 
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integration of multiple constructs (i.e., more than one type of construct) into 
multiple loca^^ -Jlierefbre, a ceU can contain only one.type of 

vector .bonstmcfpr differerirtyp^ qf;conslructs, each capa^e o^activnihig an 
endogenous gene. 

The invention is also (iirected to methods for making the cells described 
abov^by one or more of the following: introducing one or more of the vector 
constructs; aUowing the introduced construct(s) to integrate mto the genome of 
the cell by non-homologous recombination; aUowing over-expression of one or 
morp endogenous genes in the cell; and isol ating and r.]nn,nf. 



The invention also encompasses methods for usmg the cells described 
above to over-express a gene that has' been characterized (for example, 
sequenced), unchaiacterized (for example, a gene whose function is known hut 
which has not been cloned or sequenced), or a g«ie whose existence was, prior 
to over^expression, unknowi/.. The cells^be used to provide desired amounts 
lc^a_gene product/^ vitro or in vivo, ihe gene product can then be" isolated and 
purified if desired. It can be purified by cell lysis or fiom the growth medium (as 
whengevector contains a secretion signal sequence) 



30 



The invention also encompasses libraries . of cells made by the above 
described methods. A library can encomp.-iss all of the clones from a single 
transfection experiment or a subset of clones from a single transfection 
expenment. The subset can over-express the same gene or more than one gene, 
for example, a class of genes. Ihe transfection can have been done with a single 
constmct or with more than one construct . • . . 

• A library can also.be fonned by combining all of the recombinant cells 
fiom two ormorettansfectionexperiments,by combimngone ormor^ subsets of 
ceUs from a smgle transfection experiment or by combimng subsets of cells from 
separate transfection experiments. The resulting library can express the same 
gene, or more than one gene, for example, a class of genes. Again, in each of 
these individual tiansfections, a umque constmct or more than one constmct can 
be used. 
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The invention accordingly is also dl^i^tedl^.^a^TOSTE^ 



I of cells to over-express endogenous gene^. n,e Hbmry is screened for the 
expression of the gene and ceUs are selected^that express the desired gene 

[product. TlxeceU can theubensedtopuzify the geneproductfor subsequent use 
Expression in the cell can occur by culturiog the cell /„ vUro or by allovdng the 
cell to express ^e^^jn y/yp, 

laprefeired embodiments of the invention, the metlt^ilS^Jri^^;^ 
wheremtheexpressionproductispuxified. In highly preferred embodiments the 
cells expressing the endogenous gene product are cultured so as to produce 
amounts Of gene product feasible for co.mnercial application, and especially 
diagnostic and therapeutic and drug discovery uses. 

Anyofthexnethodscanfiirthercomprisemtroducingdouble-ste^^ 
into the genomic DNA in the cell prior to or simultaneously vath vector 
integration. 

The invention also encompasses novel vector constructs for activating 
gene egression or over^xpressing a -gene through non-homologous 
recombmanon. novel construct lacks homologous targeting sequences. Th.t 
IS, It doesnot contain nucleotide sequences that target host cell DNA andpromote 

homoIogousrecombinationatthetargetsite^causingover-expressingofaceUd^ 
. ge^e^atbeintroducedtranscriptionalregulatorysequencc. - 

Novel vector constructs include a vector containing a transcriptional 
regulatory sequence operably linked to an unpaired splice donor sequence d 
further contains an amplifiable marker. 

■Novel vector ccnsliucts include constructs Math a transcriptional 
regulatory sequence operably linked to a translational start codon, a signal 
secretion sequence, and an unpaired spUce donor site; constructs with a 
transcriptional regulatory sequence, op^bly linked to a translation start codon 
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The methods are also capable of producing over-expressiop of known 

: , • A "kno^- generelates to the level of chafacterizatidn of a gene: 'The ' 
invention allows expression of genes that have been characterized as well as of 
. genes that have not been characterized. Different levels of chaiacterization are 
posable. These include detailed characterization, such as cloning. DNA, RNA, 
and/or protein sequencing, and relating the regulation and lunction of the gene to 
the cloned sequence (e.g, recognition of promoter and enhancer sequences, 
functioixs of the open reading frames, introns, and the like). Chamcterization can' 
be less detaHed, such as having mapped a gene and related fu action, or having a 
partial amino acid or nucleotide sequence, or having purified a protein and 
ascertained a function. Characterization may be minimal, as when a nucleotide 
or amino acid sequence is known or a protein has been isolated but the function ' 
is unknown. Alternatively, a function may be known but the associated protein 

ornucleotidesequenceisnotknownorisknownbutisnotrel.tedtoihefbncdon, ' 
Finally, there may be no characterization in that both the existence of the gene 
and its function are not known. Hie inventio n allows expression of any gene at 
any of these or other specific degrees of characterization. 

Many different proteins can be activated or over-expressed by a single 
activation construct and in a single set of tnmsfections. Thus, a single cell or 
different ceUs in a set of transfectants Oihrary) can over-express more than one 
protein following transfection with the same or different constructs. Previous 
activation methods require a unique constmct tq be cieated for each gene to be 
activated. • • • . - ' ■ - " 

Further, many different integration sites adjacent to a smgle gene can be 
created and tested simultaneously using a single construct. ITiis allows.rapid 
determmation of the optimal genomic location of the activation construct for 
protein expression. 

Using previous methods, the 5' end of the gene of interest had to be 
extensively characterized with respect to stsquence and structure. For each 
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into the genome of the cell hy non-homologous recombination, and allowing 
• • ■ "ver-expressionpftheendogenpusgenei^ Themethod does not ^nire ■ 

•■ .. .prevxous knowledgb of lie sequence of tha endogenous gene or' even of the • ' ' 
existence of the gene. Where the sequence of the gene to be activated is known 
Wver, the constmcts can be engineered to contain the proper configi^ation of 
vector elements (e.g, location of the start codon. additi^ of codons present in the 

first exon ofthe the endogenous gene, and the proper reading frame)to achieve 
-^^?ii^£^!f!i5!!ff^f^^^ protein sequence. 

' The cell containing the vector is screened for expressi^^^rf^ gene. " J 
The cell over-expressmg the gene can be cultured in vitro so as to produce" J 
desired amounts of the gene product of the endogenous gene that has been 
acuvatedorwhoseexpressionhasbeenincrea^d: Hie gene product can then be ' 
isolated and purified to use, for example, in protein therapy or drug discovery. 

. Alternatively, the ceU expressing the desired gene product can be allowed 
to e;qjress the gene product in vivo. 



I 



The vector construct can consist essentiaUy of tl,e transcription^ 
regulatory sequence. 

Alternatively, the vector construct can - consist essentially of the 
transcriptional regulatory sequence and the amplifiable marker. 

The invention, therefore, is also directed to methods for over-expressing 
an endogenous gene in a cell, comprising introducing a vector containing a 
transcriptional regulatory sequence and an amplifiable marker into the' cell 
allovong the vector to integrate into the genome of the cell by non-homologous' 
recombination, and allowing over^xpression of the endogenous gene in the ceU. 
The cell contaming ibe vector is screened for over-expttssion of the gene. 
The cell over^xpressing the gene is cultured such thai amplification of^ 
the endogenous gene is obtained. TT^e cell ca,» then be cultured /«vf/ra so as to 
produce desired amounts of the gene product of the amplified endogenotis gene > 
that has been activated or whose expression has been increased. The gene . 
product can then be isolated and purified 

■ \y 
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. Alternativel y, foUowing amplification, the cell can be aUowed to express 
t^e enddgcnoug gene aad produce desired amounts of the g ene product 7;; yivo 



The Vector;. constrtict.- can .consist ..essentially of ' the.^traiisdripiionaL 
regulatoiy sequence and the splice donor sequence. 

The invention, fterefore, is also directed to methods for over-expressing 
an endogenous gene in a cell comprising introducing a vector containing a 
transcriptional regulatory sequence and an unpaired splice donor sequence into 
the cell, aUowing the vector to integrate into the genome of the cell by 

non-homologous recombination, and allowijigover-expressiooofthe endogenous 
gene in the c ell. 

The cell containing the vector is screened for expression of the gene. I 
The cell over-expressingthe gene can be cultured in ^ntro so as to produce 
desirable amounts of the geneproduct of the endogenous ge,e whose expression 
has been activated or in^eased. Hie gene product can then be isolated and 
purified. 

AltOTiaevely, the cell can be allo^ved to express the desired gene product 

in vivo. 



The vector construct can consist essentially of a transcriptional regulatoJ^^ 
sequence operably linked to an unpaired- splice' donor sequence and also 
containing an amplifiable marker. 

Other activation vectors include constructs with a transcriptional 
regulatory sequence and an exonic sequence containing a start codon; a 
transcriptional regulatory sequence and-irn exonic sequence containing a, 
translational start codon and a secretion signal sequence; constructs with a 
transcriptional regulatory sequence and m exonic sequence containing a 
translation start codon, and an epitope tag; constructs contaiamg atianscriptional 
regulatory sequence and an exonic sequence containing a translational start 
codon, a signal sequence and an epitope tag; constructs containing a 

transcriptionalregulatory sequence and an exoiiicsequencewilhatranslation start 
codon, a signal secretion sequence, an epitope tag. and a sequence^specific 
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dihydrofolate reductase (DHFR). adenp^e deaminase (ada), dihydro-orotase 
.. filutaminesyiithbtase(GS^^ 

- .me v^or iay contain ei&aiyotic viral origins of repUcation use jbi for 
gene amplication. These origin, may he present in place of. or in conjunction 
with, an amplifiable marker. 

Hie vector may also contain genetic elements usefid tor the propagation 

oftheconsl^ctinrnicro-organisms.ExampI«sofusefid genetic elements i^^^^^^ 
imcrobial origins of repUcation and antibiotic resistance markers. 

Hiese vectors, and any of the vectors disclosed herein, and obvious 
variants recogui^d by one of ordinary skill in the art, can be used in any of the 
methods described above to form any of the compositions producible by those 
methods. 

Nonhomologous integration of the construct into the genome of a ceU 
results in the operable Imkage between the regulatory elements fiomtbe vector 
and the exons from an endogenous gene. Inpreferred embodiments, the insertion 
of the vector regulatory sequences is used to upregulate expression of the 
endogenous gene. Upregulation of gene expression includes converting a 
transcriptionaUy sHent gene to a tr^oiscriptioaally active gene. It also includes 
enhancement of gene expression for genes that are already transcriptionaUy 
achve. but produce protein at levels lower th.m desired. In other embodiments, 
expression of the endogenous gene may be affected in otiier ways such as^ 
downregulation of expression, creation of an inducible phenotype. or changing 
_^e tissue specificity of express ion. ■ 

Cells produced by this method can Wused to produce protem wvl^ 
(e.g., for nse as a protein therapeutic) or fa vivo (e.g., for use in ce U therapy). 

The invention also encompasses cells jnade by any of the above mefhodT 
Reinvention encompasses ceUs containiqgihe vector constructs, ceUs in which 
the vector constructs have integrated, and cells vAich are over-expnsssing desired 
gene products from an endogenous gene, ovcr-expression being driven by the 
introduced transcriptional regulatory sequence. 
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and/or DNA breakbg agents, applied either together (to the same cells), or 

groiips of cells and then combining the . cells ■. 
together to pipduw £he library). The Hbrary can be composed of multi^le i^es 
of cells containing a single or multiple constnicts which were integrated into the 
genome of a cell treated with radiation, restriction enzjrmes, and/or DNA 
breaking agents, applied either together (to the same cells) or separately (appUed 
to individual groups of cells and then combini ng the cells together to produce the 
libraiy), 

The invention is also directed to methods for making libraries by selecting 
various subsets of cells from the same or different transfection experiments. For 
example, all of the cells expressing nuclear factors (as determined by the presence 
of nuclear green fluorescent protein m cells ttansfected with construct 20) can be 
pooled to createalibrary of cells with activatednuclear factors. SimUarly. cells 
expressing membrane or secreted proteins can be pooled. Cells can also be 
grouped by phenotype, for example, growth factor independent growth, growth ' 
factor independent proliferation, colony formation, cellular differentiation (e.g., 
differentiation into a neuronal cell, muscle cell, epitelial ceU, etc.), anchorage 
independent growth, activation of ceUular factors (e.g., kmases, transcription 
factors, nucleases, etc.), gain or loss of cell-cell adhesion, migration, or cellular 
activation (e.g., testing versus activated T cells'). 

The invention is also directed to metJiods of using libraries of cells to 
oyer^xpress an endogenous gene. Hie Hbiaiy is screened for the expression of 
the gene and cells are selectedthat express the desired gene product. The cell can 
then be used to purify the gene product for subsequent use. Expression 6f the cell 
can occur by culturing the ceU in vitro or by allowing the cell to express the gene 
in vivo. 



The invention is also directed to methods of using libraries to identify 
novel gene and gene products. 

The invention is also directed to methods for increasing the efficiency of 
gene activation by treating the cells with agents that stimulate or effect the 



